The Impact of Data Perturbation Techniques on Data Mining
نویسندگان
چکیده
Data perturbation is a data security technique that adds 'noise' to databases to allow individual record confidentiality. This technique allows users to ascertain key summary information about the data while preventing a security breach. Four bias types have been proposed which assess the effectiveness of such a technique. However, these biases deal with simple aggregate concepts (averages, etc.) found in the database. In e-commerce applications, organizations are interested in applying data mining approaches to databases to discover additional knowledge about customers. In our study, we propose a fifth type of bias that may be added by perturbation techniques (Data mining Bias), and empirically test for its existence. Our results find support for this bias, and propose future research avenues that are appropriate for the emerging interdisciplinary field in data security, e-commerce and data mining.
منابع مشابه
A Case Study of the Impact of Parental Diseases on the Probability of Hypertension Using Data Mining Techniques
Introduction: Hypertension is one of the most common health problems. As it has a major impact on other serious diseases such as cardiovascular diseases and strokes, and due to not having any specific symptoms, it is known as a silent killer. Therefore, proper diagnosis, control, and treatment of hypertension is crucial in health care systems and will indeed prevent the development of the other...
متن کاملA Case Study of the Impact of Parental Diseases on the Probability of Hypertension Using Data Mining Techniques
Introduction: Hypertension is one of the most common health problems. As it has a major impact on other serious diseases such as cardiovascular diseases and strokes, and due to not having any specific symptoms, it is known as a silent killer. Therefore, proper diagnosis, control, and treatment of hypertension is crucial in health care systems and will indeed prevent the development of the other...
متن کاملUsing data mining techniques for predicting the survival rate of breast cancer patients: a review article
This review was conducted between December 2018 and March 2019 at Isfahan University of Medical Sciences. A review of various studies revealed what data mining techniques to predict the probability of survival, what risk factors for these predictions, what criteria for evaluating data mining techniques, and finally what data sources for it have been used to predict the surv...
متن کاملExtracting the Hidden Patterns Affecting Mental Health through Data Mining Techniques
Background and Objective: This study was conducted to shed light on the hidden relationships, trends, and patterns of the teenagers’ mental health dataset based on data mining techniques. Materials and Methods: The proposed method has four parts as follows: data preprocessing, data cleaning, target class selection, and extracting rules. The classes included inappropriate, moderate, and accepta...
متن کاملCredit scoring in banks and financial institutions via data mining techniques: A literature review
This paper presents a comprehensive review of the works done, during the 2000–2012, in the application of data mining techniques in Credit scoring. Yet there isn’t any literature in the field of data mining applications in credit scoring. Using a novel research approach, this paper investigates academic and systematic literature review and includes all of the journals in the Science direct onli...
متن کاملEffects of Drying Temperature and Aggregate Shape on the Concrete Compressive Strength: Experiments and Data Mining Techniques
The main purpose of this paper is to assess the impact of the geometry and size of the aggregate, as well as the drying temperature on the compressive strength of the ordinary concrete. To this end, two aggregates with sharp and round corners were prepared in three different aggregate sizes. After preparing concrete samples, the drying operations were carried out in the vicinity of room tempera...
متن کامل